Phoneme recognition in fixed context using regularized discriminant analysis

نویسندگان

Algimantas Rudzionis

Vytautas Rudzionis

چکیده

Speaker independent discrimination of four confusable consonants in the strictly fixed context of six vowels is considered. The consonants are depicted by features of consonant’s stationary part and changing rate of features (delta features) in transition from consonant to the following vowel. The mel frequency cepstrum (MFCC), linear prediction cepstrum (LPCC), recursive filter (F12) features and set of discriminants were evaluated seeking for better phoneme discrimination. It is postulated that Gaussian mixture capabilities are similar to k-means (kMN) capabilities and several discriminants including regularized discriminant analysis (RDA) were analyzed too. The experiments showed that the discrimination error averaged per environments of six vowels decreases from 23.3% using kMN to 7.0% using RDA for the best F12 features. Consonant discrimination error rate decreases from 21.6% to 3.6% in the open vowel context and from 27.9% to 11.4% in closed vowel context.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Discriminant spectrotemporal features for phoneme recognition

We propose discriminant methods for deriving twodimensional spectrotemporal features for phoneme recognition that are estimated to maximize the separation between the representations of phoneme classes. The linearity of the filters results in their intuitive interpretation enabling us to investigate the working principles of the system and to improve its performance by locating the sources of e...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Regularized discriminant analysis for face recognition

This paper studies Regularized Discriminant Analysis (RDA) in the context of face recognition. We check RDA sensitivity to different photometric preprocessing methods and compare its performance to other classifiers. Our study shows that RDA is better able to extract the relevant discriminatory information from training data than the other classifiers tested, thus obtaining a lower error rate. ...

متن کامل